In this paper we address a method to align English-Chinese bilingual newsreports from China News Service, combining both lexical and satisticalapproaches. Because of the sentential structure differences between English andChinese, matching at the sentence level as in many other works may result infrequent matching of several sentences en masse. In view of this, the currentwork also attempts to create shorter alignment pairs by permitting finermatching between clauses from both texts if possible. The current method isbased on statiscal correlation between sentence or clause length of both textsand at the same time uses obvious anchors such as numbers and place namesappearing frequently in the news reports as lexcial cues.
展开▼